Speech recognition, sylabification and statistical phonetics

نویسنده

  • Melvyn John Hunt
چکیده

The classical approach in phonetics of careful observation of individual utterances can, this paper contends, be usefully augmented with automatic statistical analyses of large amounts of speech. Such analyses, using methods derived from speech recognition, are shown to quantify several known phonetic phenomena, most of which require syllable structure to be taken into account, and reveal some apparently new phenomena. Practical speech recognition normally ignores syllable structure. This paper presents quantitative evidence that prevocalic and postvocalic consonants behave differently. It points out some ways in which current speech recognition can be improved by taking syllable boundaries into account.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dependence and independence in automatic speech recognition and synthesis

When automatically recognising or synthesising speech by computer, we are forced to make a number of assumptions of statistical independence in order to make certain problems tractable. This paper gives a few examples of how phonetic knowledge is already usefully informing these decisions about independence, and a few examples of where it isn’t, yet. Temporal integration – how information from ...

متن کامل

Speech Recognition Based on Syllable and Pseudo-articulatory Features

The prevailing approach to speech recognition is the statistical technique known as hidden Markov modeling (HMM), which is capable of reasonable performance in general usage (~95%) – but not much more. The major drawback is that it ignores phonetics, which has the potential for going beyond the acoustic variations to provide a more abstract underlying representation. Also, HMM only produces a s...

متن کامل

Automatic Phonetic Transcription of Non − Prompted Speech

Automatic Segmentation" (MAUS) system labels and segments the phonetic constituents of spoken German in a manner similar to highly trained phoneticians. MAUS has been used to train automatic speech recognition (ASR) systems as well as to provide detailed statistical analyses of spontaneous speech (using the Verbmobil I and RVG I corpora). The MAUS system is a reliable, automatic means of testin...

متن کامل

Phonetics and Speech Technology

Is there a need to apply phonetics in speech technology development? How can phonetic thinking influence the quality of the final product (synthesised speech, speech recognition)? What happens if phonetic aspects are not used? What branches of phonetics are used, and what is to be used in speech technology? How phonetic thinking can be embedded into the development procedure of a speech technol...

متن کامل

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004